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ABSTRACT 

A model for genetic algorithms with semantic nets is derived for which the 
relationships between concepts is depicted as a semantic net An organism 
represents the manner in which objects in a scene are attached to concepts 
in the net. Predicates between object pairs are continuous valued truth 
functions in the form of an inverse exponential function (e'P ixl ). l:n 
relationships are combined via the fuzzy OR (Max [...]). Finally, 
predicates between pairs of concepts are resolved by taking the average of 
the combined predicate values of the objects attached to the concept at the 
tail of the arc representing the predicate in the semantic net. The method 
is illustrated by applying it to the identification of oceanic features in the 
North Atlantic. 

keywords: genetic algorithms, feature labelling, semantic nets, fitness 
functions 
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BACKGROUND 

Genetic algorithms are a problem solving method 
requiring domain*" specific knowledge that is often 
heuristic. Candidate solutions are represented as 
organisms. Organisms are grouped into populations 
known as generations and are combined in pairs to 
produce subsequent generations. An individual 
organism' s potential as a solution is determined 
by a fitness function. 

Fitness functions map organisms into real numbers 
and are used to determine which organisms will be 
used (and how frequently) to produce offspring for 
the succeeding generation. Fitness functions 
often require heuristic information because a 
precise measure of the suitability of a given 
organism (i.e., solution) is not always 
attainable. An example is the recognition (i.e., 
labeling) of segments in a scene. General 
characteristics of objects in the scene such as 
curvature, size, length, and relationship to each 
other may be known only within broad tolerance 
levels. That is, there is great variability in 
the relationships among objects in different 
scenes . 

Semantic nets (SNs) are effective representations 
of binary relationships between concepts (e.g., 
objects in a scena) . SNs denote concepts via 
nodes in a directed graph. The arcs are labelled 


We introduce here a representation 


Because relationships (i.e., predicates) relating 
concepts are not precise, their evaluation is n 
the form of a truth functional with range [0, ] 
rather than the traditional (0,1). That is, we 
use fuzzy logic [YA75, ZA88, ZI85] to combine 
heur istically the information concerning a 
particular organism. Thus, we derive genetic 
algorithms with fuzzy fitness functions (GA/F ) . 


GENETIC ALGORITHMS 

Genetic algorithms (GAs) are search procedures 
modelled after the mechanics of natural selection. 
Shey differ from traditional search techniques in 
several ways. First, GAs have the property of 
implicit parallelism, where the algorithm is 
equivalent to a search of the hyperplanes of the 
search space, without directly testing hyperplane 
values [H075, G088] . Nearly optimal results have 
been found by examining as few as one point for 
every 2 35 points in the search space [G086] . 
Second, GAs are randomized algorithms, using 
operations with nondeterminist ic results. The 
results for an operation depend on the value of a 
random number. Third, GAs operate on many 
solutions simultaneously, gathering information 
from all current points to direct the search. 
This factor mitigates the problems of local maxima 
and noise. 

From a mechanistic view, genetic algorithms are a 
variation of the generate and test method. In 
pure generate and test, solutions are generated 
and sent to an evaluator. The evaluator reports 
whether the solution posed is optimal. In genetic 
algorithms, this generate and test process is 
repeated iteratively over a set of solutions . The 
evaluator returns information to guide the 
selection of new solutions for following 
iterations . 

GA terminology is taken from genetics. Each 
candidate solution examined is termed an organism, 
traditionally represented as a list. The set of 
organisms maintained is termed a population, and 
the population at a given time is termed a 
generation. Each iteration envolves three steps. 
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First, each organism in the current generation is 
evaluated, producing a numerical fitness function 
result. The criteria for evaluation is domain 
specific information about the relative merit of 
that particular organism. Better organisms are 
assigned higher fitness function values. Second, 
some organisms are selected to form one or more 
organisms for the next generation. Specifically, 
the number of copies of each organism selected is 
directly proportional to its fitness function. 
Third, some of those organisms selected are 
modified via genetic operators. Each genetic 
operator takes the chosen organism(a), and 
produces a new organism(s) . The most common 
genetic operators include crossover and mutation. 
This iterative procedure terminates when the 
population converges to a solution. 

The crossover operator takes two organisms 
selected and combines partial solutions of each. 
When organisms are represented with lists, single 
point crossover can be viewed as combining the 
teft hand side of one organism chosen with the 
right hand side of the other, and conversely. 
This creates two offspring. The crossover point, 
that point where the crossover takes place, is 
randomly determined. 

The mutation operator uses a minimal change 
Strategy. It takes a selected organism, and 
changes the value at one randomly determined 
position. This corresponds to a tight local 
search. The offspring produced is identical to 
the parent except at the mutation point. 


GENETIC ALGORITHM PROBLEM MODEL 
FOR OCEANIC FEATURE LABELING 

Scene recognition is an application for which the 
GA model we propose is suited. For example. Fig. 
1 (a) is a segmented image of the North Atlantic 
which Fig. 1(b) is the original image. The 
lines (referred to here as segments, * 1# s 2 , ...) 
represent boundaries between warm and cold regions 
of sea water. The problem is to classify the 
segments as Gulf Stream North Wall (NW) , Gulf 
Stream South Wall (SW) , cold eddies (CE) , warm 
eddies (WE), continental shelf (CS) , and "other" 
( 0 ). 

Relationships which can be expressed as furry 
truth functions are known to exist within or 
between classifications . Principal among these 
are (1) the average width of the Gulf Stream is 50 
kilometers, (2) the average diameter of an eddy is 
100 kilometers, (3) cold eddies are usually south 
of the Gulf Stream, and (4) warm eddies^are 
usually north of the Gulf Stream. To these" one 
must add the trivial (yet necessary) relationships 
such as the south wall is at a lower latitude than 
the north wall and the known geophysical 
coordinates of continental shelves . 

A scene consisting of classif ication categories 
( cat ^ , cat 2 , cat n ) and relationships 

expressed as truth functions (p P^ 2 ^, . ..) 
between categories can be modelled 1 as a semantic 
net (or, more precisely, an association list) . A 
generic one is shown in Fig. 2. Segments are 



Figure 1, Oceanic Features (North Atlantic) 
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Figure 2. Generic Semantic Net for Oceanic Features 


attached to the categories via the INST (instance) 
relation. An allele (or gene) is a category na roe. 
An organism is a list of categories, one allele 
for each segment. For example, given six segments 
then (NW, NW, SW, CS, CE,0) and (CE, SW, CE, O, 0, 
CS) are representative organisms. Formally, let 
an association list be defined as A * <V, F> where 
V = ( cat i , cat 2 f . .*f cat-) ia a 8#t o£ 
categories, and P - (^i j (g) I i'j 1 * * 
1,2,..,, r^j) is a set of binary predicates. 

These predicates describe the relationships 
between categories and the ideal relationship 
between segments assigned to these categories. 

Let an organism for spatial labeling is defined as 
Q ■ <S, INST>, where S - (s x , s 2 , ---# is a 
set of segments, and INST: S -> v is a function. 

Crossover Operators 

There are three applicable crossover operators. 
These include single point crossover, two point 
crossover, and varying multiple point crossover 
[ BO 8 7]. Crossover operators require the 
inposition of a total order on the segments in S. 
Let sjl < sj if i < j; s ± - Sj if i - j; «i > »j i£ 
i > j. Denote by INST Qi the instance mapping for 
organism 0 X . 

Single Point Crossover. Given < s x , s 2 , . * n >r 
choose a random integer k, 1 < k < n. For parent 
organisms 01 and 02 create an offspring, O r , such 
that 


f INST 01 ( 85 ^) if i £ It 

inst 0 , (Si) - i 

^ INST o2 (Si) if i > k 

Two Point Crossover. Let < s x , s 2 , ..., a n > be a 

circular list. Formally, succ(Si) * »i+i 
(pred(s i+1 ) - Si) if i < n and succ(s n ) - s x 
(pred(s x ) - s n ) . Choose two random integers, kl 
and k2 7 For parent organisms O x and 0 2 create an 
offspring. O' such that 


f INST 01 (S t ) if Si £ { 8 kl , succ(s JtX 5, 

INST 0 , (Si)- -*.f P r « d (»k2> * 

^ INST o2 (s x ) otherwise 

Varying Multiple Point Crossover. For parent 
organisms 0 X and 0 2 , create an offspring O' such 
that 

f INST 01 (Si) with probability 0.5 

INST/y (8 t ) » 4 a * 

^ INSTq 2 (*i) with probability 0.5 


Mutation Operator 

Our mutation operator selects one segment randomly 
and assigns it to a randomly determined category. 
Choose two random integers kl, 1 < kl < n, and k2, 

1 < k2 < m . Remove s kX from its current category 
in* organism O and attach it to cat k2 (i.e., set 
INST 0 (s kl ) - cat k2 ) . 

Fitness Function 

For the model, the fitness function is the sum of 
all satisfied predicates in the semantic net. Let 
E denote the function. Let P x '9', be defined as 
above, with m possible categories. Then 


mm- 

r ij 

E - X £ E 
j-i i -1 g -1 




<sr> 


(i) 


p (g) i® a predicate for a relationship between 
categories, i and j. Each predicate p iV g) haa a 
corresponding derived predicate, pred X j (h, 1 ) , 
for an analogous relationship between segments s k 
and Si, where s k is in category i and s x is in 
category j . P x j W is interpreted based on the 
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normalized truth value of the derived predicate. 
Specifically, 

f Z Z Pred i j (k, i) 

[ -£l-^k (2) 

1 Icatil x 1 cat j | 

O otherwise 

where |catj_| and |catj| are the number of segments 
classified as category i and category j, 
respectively. Because all such predicates are not 
defined between all possible pairs of segments, 
the normalising factor (the denominator) is 
subject to redefinition on a case by case basis . 
Alternatives to (2) are described following the 
description of derived predicates below. 

An example of a fussy predicate Pw W from our 
domain is the relationship “is near", where 
category i "is near" category j. The 
corresponding derived predicate predw <g) (if 1) 
describes the relationship between two segments, 
s k *** category i and s x in category j. The sum of 
P r ®^ij g (h,l) for all possible pairs of segments 
•k ana s^ is normalised by the maximum possible. 

Definitions of pred^ j W (k, i) are dependent on the 
underlying semantics of the problem domain. One 
*PP r oach is to define them propositionally as 
(0,1) if a measurable relationship between s^ and 
*1 la within or beyond some threshold. A second 
approach preferred here is to define th em as fussy 
truth functions on the interval [0,1]. Inverse 
exponential truth functions are commonly used in 
fussy set theory to measure the "nearness" of two 
concepts. An alternative nearness measures are in 
[ZI85J . For example, if the description of S> Ai W 
contains a nominal value (e.g., the SW is 
approximately 50 kilometers from the NW) then let 
x o represent the nominal value and 


pred^*(k, 1) « a -pJXo-X| 




where 

X is the observed value corresponding to the 
same measure (distance, curvative, angle of 
declination) between s^ and s^ 

P is a constant contrast factor in [0,1] 
which emphasises the magnitude of the 
between the observed and nominal 
value when increased 

many situations for which the nearness 
measure is not bounded by an ideal but the closer 
to s k the better. In such cases, X Q can be 
replaced by sero in formula (3) . 


Such relationships can be considered as ordinary 
propositional truth values. 

(g) ( 1 ®k and a i »o related 

P ij <k ' 1> " n <5) 

J ^ 0 otherwise 


If there is a measure X associated with the 
relationship and X k > x x when the condition is 
met, the derived predicate of formula (5) can be 
represented by the ceiling function 


PEe i j ? " r (X k‘ X l > 1 < I ■ X k -Xi I +1 ) 1 


(6) 


for P ( S> 13 , each objact attached to ca^ ra<juiraa 
jeat^j evaluations of pred^) ± . . The multiple 
evaluations are combined to a single value using 
fuzzy OR 


max lpr«3 ) , i) ] ; 


for each in cat^ 


(7) 


This corresponds to finding the best segment, s x , 
that matches the relationship for a given segment 
*k- B y contract, the combination rule 


min rpreS^jk , 1)]; for each s k in cat 1 (8) 

corresponds to fuzzy AND. The heuristic implied 
by the formula (2) is 


Z pred^jk,!) /{catj [ ; for each s k in ca^ (9) 


which corresponds to the average truth functional 
value of S]^ with ell s^ segments in cat j . 

Let Z* g ^ij<k) stand for the segment level 
combination rule, (7) , (8) , or (9) . Possible 

aggregation rules to compute P i j (?) are 


J« (i^jk)] 

min [i®Jk)] 


( 10 ) 

( 11 ) 

( 12 ) 


"Not near" or "as distant as possible" may be 
measured by the fuzzy complement of (3) . 


which correspond to average, best, and worst 
match, respectively. The aggregation rule of 
formula (10) ia the one implied by formula (2) . 


pt *ij ) (k ' l) " 1 ■ f ° «> 

*0 the right side of formula (3) , 

8ome relationships such as "above" or "smaller" 
are not easily modelled as nearness measures. 


EXAMPLE 

Fig. 3 is a reproduction of Fig. l (a) with moat 
segments labelled (correctly) . Eight segment* are 

* 2 9 ■ * • ' ®8 an< * ar ® used below in 
an example. Table 1 lists and defines all 
predicates and derived predicates required for the 
•emantic net of Fig. 2. The notation |cat h | 
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Table 1 . Predicate Descriptions 


p(l) 

max [exp(-0.5 *)l/lc°or| 

01,01 

X 

D (*> 

mx [exp(-0.5 x))/(|cat c# |-l) 

P C*,C» 

x where k#l 

p(l) 

(1/ loat^ I ) E[exp<-0 . 5 1 100-xl ) ] / loat^ | 

we, we 

X 

P< 2 > 

we, we 

max [exp (-0 . 5 x) ] / ( |cat w- | -1) 
x where k*l 

p u> 


we,nw 

X 

1 

pU> 

MX [exp (-0.5 x) ] / |cat nw l *1) 

nw,nw 

x where k#l 

pU> 

(l/|cat iW |)It**p('0-5|50-x|)]/|oat nit | 

nw,*w 

X 

P (2) 

(l/|c«t,.|)I r [<X k -X 1 )/(|X k -*ll +1 » 1 

aw, sw 

X 

1 

pd) 

aw, aw 

■ax [•*P(“0- 5 x) J /I oat sw | -1) 
x where k*l 

P (1) 

(l/|oat M |)I T ( 

aw,ce 

X 

1 

pd) 


ce,c« 

X 

p<2> 

ft** [exp (-0.5 x) ) / 1 (cat oa | -1) 

P ce, ce 

x where k*l 

p(D 

■ax [exp(-0.5 x) ] /| (cat Q | -1) 

P o,o 

x where k#i 

p<*> 

(1/ ]oet # | ) E [ 1-exp (-0 . 5x) J/|cat G l 


X 


Daacrlptlon 

near known CS coordinates (distance - a) 
near other CS segment (distance * x) 

UK diameter near 100 km (distance “ x) 
near other HE segment (distance * x) 

WE north of NW (X k and X x are latitudes) 
near other NW segment (distance - x) 

NW 50km from SW (distance - x) 

MW north of SW (X k and X x are latitudes) 
near other SW segment (distance * x) 

SW north of C* (X k and X x are latitudes) 
d diameter near 100 km (distance - x) 
near other CK segment (distance * x) 

near other O segment (distance - x) 
not near CS, WH, CK, NW, or SW 


refer* to the number of segments that are an 
instance of category h. The value 0.5 ia c ose 
arbitrarily for P in all derived predicates. The 
exponential form of derived predicates is used for 
all relationships except "north of" where formula 
(6) ia substituted. The default value for any 
predicate or derived predicate is zero should a 
denominator evaluate to xero . 

The eight segments distinguished in Fig, 3 are 
characterised in Table 2. For this example, we 
need only the geophysical coordinates, the 
distances between segment centroids, and the 
distances between the closest points of segments. 
X larger, more complete description might also 
contain the length and degree of curvature of each 
segment . 

Table 3 lists six organisms together with their 
fitness function values which are computed using 
the predicates in Table 1. The fitness function 
ie given by formula (2) . The combination and 
aggregation rules are formulas (7) and (12>, 
respectively . Derived predicates are variations 


of formulas <3) and (4) except "north of", which 
is represented by formula (6) with the requisite 
measure being latitude. Organism O x has no 
segments labelled incorrectly. 0 2 has two 
segments labelled incorrectly. 0 3 through 0 6 have 
3, 3, 5, and 8 incorrectly labelled segments, 
respectively. The fitness function values 
correspond roughly to the correctness of the 
labelling. Additional predicates (i.e., a more 
complex semantic net) would improve upon the 
ordering and separation in most cases. 

CONCLUSION 

A model for labelling complex scenes via genetic 
algorithms with fuzzy fitness functions evaluated 
over semantic nets and GAs is possible. Truth 
functionals indicating the degree to which 
specific interfeature relationships are fulfilled 
are combined at the segment level then aggregated 
at the category level using fuzzy set operators . 

We are currently investigating such issues as the 
effect of many predicates clustered on one or two 
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[TB86] 


°i 

°2 


1* 3. Fitness Function Values 
Selected Organisms 

for 

[YA75] 

<NW sw nw sw ca ca ca ca> ; 

*<°]> " 

2.2098 


<sw sw ww ww ca ca ca ca> ; 

«<°2) “ 

2.2511 

[ZA88] 

<sw sw ww mw ca ca mw sw> ; 

■ 

o 

M 

2.1251 

[ZI85] 

<sw SW MW ca mw ca ca ca> ; 

: «(0 4 ) - 

1.4731 


<mw mw ca ca sw mw sw cx> 

; t(0^ - 

1.6757 


<sw ca sw ca sw mw sw hw> 

; *<o f ) - 

0.9235 
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categories , alternate forms for the truth 
functionals themselves, and the crossover rules. 
Our image set consists of six segmented infrared 
photographs of the North Atlantic, each photograph 
having a different degree of observation. Our 
testbed will consist of a GA algorithm capable of 
manipulating the alleles' correspondence to the 
semantic net . 
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